搜索资源列表
SegmentRAM
- 1.基于luncene倒排索引格式的高性能索引数据访问接口。 2. 实现若干中文分词的的分词算法。-1. Based on luncene inverted index format of high-performance data access interface Index. 2. A number of Chinese word segmentation algorithm of.
srcfileread_10_31
- 对文件进行操作,对索引倒排表中的文件进行相应的操作-the documents operation, the inverted index table of the corresponding documents to operate
invIndexDoclet
- This code sample shows how to write a simple Javadoc 1.2 Doclet. Used with Javadoc, it can generate a Unix-style inverted index for a set of Java classes, with links to real Java API documentation. Includes directions and source code. This doclet
inverted_index.rar
- 简单的文件倒排实现,搜索引擎实现的步骤之一。大量使用STL,实现简单容易理解。效率一般。,Simple realization of inverted files, search engines to achieve one of the steps. Extensive use of STL, the realization of simple and easy to understand. Efficiency in general.
SearchEngine1.0
- 实现搜索引擎最基本的下载网页、建立倒排索引、关键词查询功能。程序的实现借助了libcurl库。-Search engine to achieve the most basic functionality of downloading page, seting up inverted index, keyword querying. Program implementation with the libcurl library.
hadoop-invertedindex
- 基于Hadoop的反向索引的生成工具。输入一系列文本文件,输出word和其出现的文档和位置-Hadoop-based reverse index generation tool. Enter a series of text files, the output word and its emergence and location of the document
src
- 采用倒排索引做的相似搜索的源码 。 技术点:倒排索引、归一化、k-临近相似计算-Inverted index used to do similar searches for source
VSM
- 向量空间模型算法,给定一个经过分词的文档集,可以输出向量空间模型、特征词典、倒排索引表等功能,很经典的VSM算法源代码-Vector space model algorithm, given a segmentation of the document set, you can output vector space model, the characteristics of dictionaries, inverted index table functions, it is the clas
IndexDemo
- 一个学习数据结构的好例子,通过此代码能够学习倒排索引时如何创建的,本代码在vc6.0下运行通过-A good example of learning data structure, through this code to learn how to create the inverted index, and the code vc6.0 run through! ! !
CluceneTest
- CLucene 例程 是信息检索的初步学习参考代码,可以了解倒排索引的文件结构以及查询方法-CLuene Demo is a reference code for learning information retrival,which can help to learn the inverted index data structure and query method
ir
- 本系统实现了分词和倒排索引,分词采用正向最大匹配,-The system achieved a sub-word and the inverted index, the biggest being the use of sub-word match,
1
- 建立倒排索引的程序雏型.主要采用链表的数据结构.-Inverted index to establish the procedures for the prototype. The main use of the linked list data structure.
mifluz-0.24.0.tar
- mifluz 的目的是提供一个存储倒排索引c++库,允许存放关键词以便事后进行搜索。-The purpose of mifluz is to provide a C++ library to store a full text inverted index. To put it briefly, it allows storage of occurrences of words in such a way that they can later be searched. The basic id
lucene_indexer
- 网页的除噪和预处理,利用lucene建立一个倒排索引,另外利用了HTMLparser对网页的解析进行了优化除噪。-In addition to web pages and pre-noise, using lucene an inverted index, another advantage of HTMLparser analysis on pages optimized denoising.
text_mining
- 本文讨论了在关系数据库下倒排索引的建立、编码、压缩、存储、获取以及并行化等问题。-We focused our discusion on the building,encoding,compressing,storing,retrievingand parallelization of inverted index.
phpsou
- RiSearchPHP是一个高效,功能强大的搜索引擎,特别适用于中小型网站。RiSearchPHP非常快,它能够在不到1秒钟内搜索5000-10000个页面。RiSearch是一个索引搜索引擎,这就意味着它先将你的网站做索引并建立一个数据库来存储你网站所有页面的关键词以便快速搜索。Risearch是全文搜索引擎脚本,它把所有的关键词都编成一个文档索引除了配置文件里面的定义排除的关键词。RiSearch使用经典的反向索引算法(与大型的搜索引擎相同),这就是为什么它会比其它搜索引擎快的原因。-RiS
examples
- i want to build a inverted index
inverted-mapreduce-code
- inverted index mapreduce code-code of inverted index using mapreduce
Inverted-Index-master
- Inverted index file. Very good to sutents, also professors. Stay good to change and optimize this code.
inverted-index
- This for inverted index based on java-This is for inverted index based on java